The Relational Vector-space Model and Industry Classification
نویسندگان
چکیده
This paper addresses the classification of linked entities. We introduce a relational vectnr-space (VS) model (in analogy to the VS model used in information retrieval) that abstracts the linked structure, representing entities by vectors of weights. Given labeled data as background knowledgdtraining data, classification procedures can be defined for this model, including a straightforward, "direct" model using weighted adjacency vectors. Using a large set of tasks from the domain of company affiliation identitication, we demonstrate that such classification proccdurcs can be effective. We then examine the method in more detail, showing that as expected the classification performance correlates with the relational autocorrelation of the data set. We then turn ~-~~~~~~ the tables and use the relational VS scores as a way to analy7.dvisualize the relational autocorrelation present in a complex linked structure. The main contribution ofthe paper 1s to introduce the relational VS model as a potentially useful addition to the toolkit for relational data mining. It could provide useful constructed features for domains with low to moderate relational ~~~ ~ ~~ ~ autocomelation; it may be effective by itself for domains with high levels of relational autocorrelat~on, and it provides a useful abstraction for analyzing the properties of linked data. General Terms Algorilhms, Performance, Design, Experimentation
منابع مشابه
Space Vector Modulation Based on Classification Method in Three-Phase Multi-Level Voltage Source Inverters
Pulse Width Modulation (PWM) techniques are commonly used to control the output voltage and current of DC to AC converters. Space Vector Modulation (SVM), of all PWM methods, has attracted attention because of its simplicity and desired properties in digital control of Three-Phase inverters. The main drawback of this PWM technique is 
its complex and time-consuming computations in real-time ...
متن کاملSpace Vector Modulation Based on Classification Method in Three-Phase Multi-Level Voltage Source Inverters
Pulse Width Modulation (PWM) techniques are commonly used to control the output voltage and current of DC to AC converters. Space Vector Modulation (SVM), of all PWM methods, has attracted attention because of its simplicity and desired properties in digital control of Three-Phase inverters. The main drawback of this PWM technique is its complex and time-consuming computations in real-time im...
متن کاملAn Improved Flower Pollination Algorithm with AdaBoost Algorithm for Feature Selection in Text Documents Classification
In recent years, production of text documents has seen an exponential growth, which is the reason why their proper classification seems necessary for better access. One of the main problems of classifying text documents is working in high-dimensional feature space. Feature Selection (FS) is one of the ways to reduce the number of text attributes. So, working with a great bulk of the feature spa...
متن کاملMacroeconomic Shocks and Malaysian Tourism Industry: Evidence from a Structural VAR Model
Abstract his study employs a structural vector autoregression (SVAR) model to investigate the macroeconomic shocks on Malaysian tourism industry, especially how the economy dynamically responds to oil price shocks, exchange rates, changes in price level, exports, economic growth and tourism income during the study time period from January 2001 to December 2012. The results indicate that oil...
متن کاملAn Improved Flower Pollination Algorithm with AdaBoost Algorithm for Feature Selection in Text Documents Classification
In recent years, production of text documents has seen an exponential growth, which is the reason why their proper classification seems necessary for better access. One of the main problems of classifying text documents is working in high-dimensional feature space. Feature Selection (FS) is one of the ways to reduce the number of text attributes. So, working with a great bulk of the feature spa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003